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Abstract 

Measure the similarity of the nodes in the complex networks have interested 
many researchers to explore it. In this paper, a new method which is based 
on the degree centrality and the Relative-entropy is proposed to measure the 
similarity of the nodes in the complex networks. The results in this paper 
show that, the nodes which have a common structure property always have 
a high similarity to others nodes. The nodes which have a high influential to 
others always have a small value of similarity to other nodes and the marginal 
nodes also have a low similar to other nodes. The results in this paper show 
that the proposed method is useful and reasonable to measure the similarity 
of the nodes in the complex networks. 
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1. Introduction 


The complex networks is a new method to describe those complex sys¬ 
tem from the mathematic. Many of the real system in the real world can be 
modeled as the complex system, such as the biological, social and techno¬ 


logical systems 
illuminated by t 


and dynamics 


l|, 2, 3J, 4, |5j. Many property of the complex networks have 
rese researchers in this hied, such as the network topology 


h la l^fi. the property of the network structure 


self-similarity and fractal property of the 


comp 


evolutionary games on complex networks 
synchronization of the complex networks 

iUHH 


16 


ex networks till, 112), 


, 10], the 
13], the 


51. the controllability and the 


17] and so on 


t 19, 18, 20, 


The similarity of the nodes in the complex networks is a new research 
direction. It is interested that ”How similar are these two vertices ?” or ” 
Which node is most similar to others nodes?”. There are many methods 


have proposed to solve this problem 24], [25|, ]26], |27], |28]. In this paper, 
a new methods which is based on the relative-entropy (KullbackCLeibler 
divergence) 29] is proposed to describe the similarity of those nodes in the 
complex networks. The definition of the probabilities of each node is based 
on the degree distribution. 

The rest of this paper is organised as follows. Section [ 2 ] introduces some 
preliminaries of this work. In section [3J. a new method to measure the sim¬ 
ilarity of the nodes in the complex networks is proposed. The application 
of the proposed method is illustrated in section [4] Conclusion is given in 
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Section 0 


2. Preliminaries 

2.1. Local network in the complex network 

Based on the existing research about the complex networks, it is clear 
that a lot of the property of complex networks are based on the structure 
property of it 2]. In the complex networks, each node’s influence on the 
whole network is decided by the neighbour nodes of it. Based on the existing 
researches about the local structure of the complex networks 


25|, |2a 
sed [311 


30j , a 


local network of each node in the complex networks is proposed |31j- The 
details of the local networks is shown as follows: 

It is clear that each local network of the target node contains the target 
node and the neighbor nodes of the target nodes. 

2.2. Relative entropy (KullbackCLeibler divergence) 

The Relative entropy (KullbackCLeibler divergence) is a basic conception 
in the probability theory and the information theory. It is proposed by 


Kullback and Leibler er.al 


29j . The Relative entropy is a non-symmetric 


measure of the difference between two probability. For two probabilities P 
and Q The definition of the Relative entropy is shown in the Eq. (PQ). 


D KL (P\\Q) = Y J P W n 


2=1 


m 

Q{i) 


( 1 ) 


Where the q and Q have the same number of the components in it. The 
components in those two probabilities is equal to n. 
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(b) The local network of node 4 (c) The local network of node 8 

Figure 1: The Network A in the subfigure (a) is a simple network. The subfigure (b) shows 
the detail of the local network of node 4. The subfigure (c) shows the details of the local 
network of node 8. 


4 








3. Measure the similarity of each node 


The proposed method is based on the definition of the local network and 
the Relative entropy. The definition of the proposed new methods can be 
divided into two parts. 

Part 1 The definition of the probabilities of each node. First, calcu¬ 
late the degree of each node. Find the maximum of the degree in the 
network. Second, set the scales of the probabilities of each node base 
on the value of the maximum degree. Third, use the degree of the 
neighbour nodes as the components of probabilities. At last, sort the 
probabilities from the high to the low. 

Part 2 The Relative entropy of each node to others nodes. Calculate 
the Relative entropy between each node’s probabilities. 

Based on the local network of each node and the degree centrality, the 
definition of the probabilities of each node is shown as follows. For example, 
we use the LN{i) represents the local network of node i. In the local network 
LN(i), the total value of degree is represented by the {T degree (i)). The i 
in the Td egree (i) represents the ith node. The node number in the local 
network LN(i) is equal to k. The maximum value of the degree in the whole 
networks is equal to max n . Then, the number of the components of each 
node’s probabilities is equal to max n + 1. The probabilities of node i is 
defined in the Eq. (J2]). 

P{i) = ..., d(k), 0,0,..., d{max n + 1)] (2) 
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where the d(j) in the Eq.Q is defined based on the degree of the node in 


the local network. 

In the P(z), the value of d{i) is defined based on the degree of the node in 
the local network i (LA^(i)). If the value of node number in the local network 
is small than the max n , then the value of d(i) will be set as 0. At last, sort 
the probabilities P{i) from the high to the low. 

An example of the definition of P{i ) are shown in the Fig. [2] 



Figure 2: This figure is a part of the Network A shows in the FigQ] The maximum 
of the degree in the Network A is equal to 6. The max n is the LN( 4) is 6. The 
number of the components in the probabilities is 7. The total degree in the LN(i) is 
equal to 23. The degree of node 1 is 2, the degree of node 2 is 2, the degree of node 
3 is 3, the degree of node 5 is 4, the degree of node 6 is 4, the degree of node 7 is 
2 and the degree of node 4 is 6. Then P(4) = [d(l), d(2), d(3), d(4), d(5), d(6), d(7)] = 
[2/23,2/23,3/23,6/23,4/23,4/23,2/23]. Then sort the components in the P(4). The 
P(4) = [6/23,4/23,4/23,3/23,2/23,2/23,2/23], 
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Then the measure of the similarity of node i and node j is defined as 
follows: 

S,J = 1 - (D KL (P(i)\\P{j))+D KL (P(j)\\P(i))) (3) 

The sum of each node similarity to others in the network is used to identify 
which node is most similar to others nodes. The big the value of the sum of 
similarity. The more similar to others nodes. 

In order to illuminate the useful of the new method an example network 
(Network A-21) is used to measure the similarity of nodes in it. The details 
of the example network (Network A-21) are shown in the Fig. [3j 



The probabilities of each node in the example network (Network A-21) 
are shown in the Table [0 

Then the similarity matrix S hJ of the nodes in the example network (Net¬ 
work A-21) is shown in the Eq.(l4l): 
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Table 1: The probabilities (P(i)) of each node in the Network A-21 


P ( l ) = [ 0.27 

0.27 

0.27 

0.18 

0.00 

0.00 

0 . 00 ] 

P ( 2 ) = [ 0.31 

0.31 

0.19 

0.19 

0.00 

0.00 

0 . 00 ] 

P ( 3 ) = [ 0.36 

0.21 

0.21 

0.21 

0.00 

0.00 

0 . 00 ] 

P ( 4 ) = [ 0.44 

0.33 

0.22 

0.00 

0.00 

0.00 

0 . 00 ] 

P ( 5 ) = [ 0.23 

0.23 

0.14 

0.14 

0.14 

0.14 

0 . 00 ] 

P ( 6 ) = [ 0.36 

0.27 

0.27 

0.09 

0.00 

0.00 

0 . 00 ] 

P ( 7 ) = [ 0.24 

0.24 

0.14 

0.14 

0.14 

0.10 

0 . 00 ] 

P ( 8 ) = [ 0.31 

0.31 

0.19 

0.19 

0.00 

0.00 

0 . 00 ] 

P ( 9 ) = [ 0.75 

0.25 

0.00 

0.00 

0.00 

0.00 

0 . 00 ] 

P ( 10 ) = [ 0.29 

0.21 

0.21 

0.14 

0.14 

0.00 

0 . 00 ] 

P ( ll ) = [ 0.45 

0.36 

0.18 

0.00 

0.00 

0.00 

0 . 00 ] 

P ( 12 ) = [ 0.33 

0.28 

0.22 

0.17 

0.00 

0.00 

0 . 00 ] 

P ( 13 ) = [ 0.50 

0.33 

0.17 

0.00 

0.00 

0.00 

0 . 00 ] 

P ( 14 ) = [ 0.40 

0.40 

0.20 

0.00 

0.00 

0.00 

0 . 00 ] 

P ( 15 ) = [ 0.24 

0.16 

0.16 

0.12 

0.12 

0.12 

0 . 08 ] 

P ( 16 ) = [ 0.44 

0.33 

0.22 

0.00 

0.00 

0.00 

0 . 00 ] 

P ( 17 ) = [ 0.38 

0.25 

0.19 

0.19 

0.00 

0.00 

0 . 00 ] 

P ( 18 ) = [ 0.35 

0.24 

0.18 

0.12 

0.12 

0.00 

0 . 00 ] 

P ( 19 ) = [ 0.35 

0.24 

0.18 

0.12 

0.12 

0.00 

0 . 00 ] 

P ( 20 ) = [ 0.33 

0.27 

0.20 

0.20 

0.00 

0.00 

0 . 00 ] 

P ( 21 ) = [ 0.43 

0.21 

0.21 

0.14 

0.00 

0.00 

0 . 00 ] 


“ l.OO 

0.96 

0.94 

0.89 

0.88 

0.91 

0.90 

0.96 

0.52 

0.96 

0.84 

0.98 

0.80 

0.88 

0.85 

0.89 

0.93 

0.90 

0.90 

0.96 

0 . 89 “ 

0.96 

1.00 

0.95 

0.95 

0.91 

0.88 

0.94 

1.00 

0.60 

0.94 

0.94 

0.99 

0.91 

0.96 

0.84 

0.95 

0.97 

0.94 

0.94 

0.99 

0.91 

0.94 

0.95 

1.00 

0.93 

0.87 

0.87 

0.89 

0.95 

0.70 

0.96 

0.89 

0.97 

0.89 

0.88 

0.87 

0.93 

0.99 

0.93 

0.93 

0.98 

0.96 

0.89 

0.95 

0.93 

1.00 

0.77 

0.96 

0.80 

0.95 

0.82 

0.88 

0.99 

0.96 

0.98 

0.98 

0.73 

1.00 

0.96 

0.93 

0.93 

0.95 

0.95 

0.88 

0.91 

0.87 

0.77 

1.00 

0.81 

0.98 

0.91 

0.37 

0.95 

0.77 

0.90 

0.74 

0.78 

0.97 

0.77 

0.89 

0.93 

0.93 

0.90 

0.84 

0.91 

0.88 

0.87 

0.96 

0.81 

1.00 

0.83 

0.88 

0.72 

0.93 

0.92 

0.94 

0.89 

0.93 

0.82 

0.96 

0.90 

0.95 

0.95 

0.89 

0.94 

0.90 

0.94 

0.89 

0.80 

0.98 

0.83 

1.00 

0.94 

0.41 

0.96 

0.80 

0.92 

0.77 

0.81 

0.95 

0.80 

0.91 

0.94 

0.94 

0.93 

0.86 

0.96 

1.00 

0.95 

0.95 

0.91 

0.88 

0.94 

1.00 

0.60 

0.94 

0.94 

0.99 

0.91 

0.96 

0.84 

0.95 

0.97 

0.94 

0.94 

0.99 

0.91 

0.52 

0.60 

0.70 

0.82 

0.37 

0.72 

0.41 

0.60 

1.00 

0.55 

0.81 

0.66 

0.87 

0.71 

0.38 

0.82 

0.74 

0.70 

0.70 

0.66 

0.81 

0.96 

0.94 

0.96 

0.88 

0.95 

0.93 

0.96 

0.94 

0.55 

1.00 

0.84 

0.97 

0.82 

0.84 

0.95 

0.88 

0.95 

0.97 

0.97 

0.96 

0.94 

0.84 

0.94 

0.89 

0.99 

0.77 

0.92 

0.80 

0.94 

0.81 

0.84 

1.00 

0.93 

0.99 

0.99 

0.69 

0.99 

0.94 

0.92 

0.92 

0.93 

0.91 

0.98 

0.99 

0.97 

0.96 

0.90 

0.94 

0.92 

0.99 

0.66 

0.97 

0.93 

1.00 

0.91 

0.94 

0.87 

0.96 

0.98 

0.96 

0.96 

0.99 

0.96 

0.80 

0.91 

0.89 

0.98 

0.74 

0.89 

0.77 

0.91 

0.87 

0.82 

0.99 

0.91 

1.00 

0.96 

0.68 

0.98 

0.94 

0.91 

0.91 

0.91 

0.92 

0.88 

0.96 

0.88 

0.98 

0.78 

0.93 

0.81 

0.96 

0.71 

0.84 

0.99 

0.94 

0.96 

1.00 

0.69 

0.98 

0.93 

0.90 

0.90 

0.93 

0.88 

0.85 

0.84 

0.87 

0.73 

0.97 

0.82 

0.95 

0.84 

0.38 

0.95 

0.69 

0.87 

0.68 

0.69 

1.00 

0.73 

0.87 

0.93 

0.93 

0.87 

0.85 

0.89 

0.95 

0.93 

1.00 

0.77 

0.96 

0.80 

0.95 

0.82 

0.88 

0.99 

0.96 

0.98 

0.98 

0.73 

1.00 

0.96 

0.93 

0.93 

0.95 

0.95 

0.93 

0.97 

0.99 

0.96 

0.89 

0.90 

0.91 

0.97 

0.74 

0.95 

0.94 

0.98 

0.94 

0.93 

0.87 

0.96 

1.00 

0.96 

0.96 

0.99 

0.97 

0.90 

0.94 

0.93 

0.93 

0.93 

0.95 

0.94 

0.94 

0.70 

0.97 

0.92 

0.96 

0.91 

0.90 

0.93 

0.93 

0.96 

1.00 

1.00 

0.95 

0.97 

0.90 

0.94 

0.93 

0.93 

0.93 

0.95 

0.94 

0.94 

0.70 

0.97 

0.92 

0.96 

0.91 

0.90 

0.93 

0.93 

0.96 

1.00 

1.00 

0.95 

0.97 

0.96 

0.99 

0.98 

0.95 

0.90 

0.89 

0.93 

0.99 

0.66 

0.96 

0.93 

0.99 

0.91 

0.93 

0.87 

0.95 

0.99 

0.95 

0.95 

1.00 

0.94 

_ 0.89 

0.91 

0.96 

0.95 

0.84 

0.94 

0.86 

0.91 

0.81 

0.94 

0.91 

0.96 

0.92 

0.88 

0.85 

0.95 

0.97 

0.97 

0.97 

0.94 

1 . 00 _ 
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(a) The similar nodes of node 1 (b) The similar nodes of node 2 










(i) The similar nodes of node 9 


(j) The similar nodes of node 10 


Figure 4: The similar nodes of each node (From node 1 to node 10) 
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(a) The similar nodes of node 11 (b) The similar nodes of node 12 










(i) The similar nodes of node 19 


(j) The similar nodes of node 20 


Figure 5: The similar nodes of each node (From node 11 to node 20) 
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From the similarity matrix, we can find that the node 2 and node 8, the 
node 4 and node 16, the node 18 and node 19 have the same structure in the 
example network (Network A-21). The details is shown in the Fig. [6} 

From the similarity matrix, we also have find that the node 9 has the 
lowest similarity to others nodes and the node 12 have the highest similarity 
to others nodes. 

From the results of our test on the example network (Network A-21), the 
measurement of the similarity of the nodes based on the Relative-entropy 
is an reasonable and useful method. The method also can be use to node 
classify in the complex networks. The node 12 have the highest similarity to 
others nodes. The degree of node 12 is equal to 3. In the example network 
(Network A-21) most node’s degree is equal to 3. It shows from the other 
hands that the degree is very important to describe the structure property 
of the complex networks. The node 9 is a marginal node, because this is no 
node has a high similarity to it. 


4. Application 


In the section, the new method is used to find the most similar node in 
four real networks. The four networks are the Zachary’s Karate Club net¬ 


work (Karate) [32|, the US-airport network (Us-airport) 


33[, Email networks 


(Email) |33jand the Germany highway networks (Highway) [34]. The results 
are shown as follows: 
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(c) The details of node 18 and node 19 
Figure 6: The details of those nodes which have same structure. 
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Table 2: The most similar node and the most marginal node in the four real networks 


Network 

Nodes 

Edages 

High similarity node 

Low similarity node 

Karate 

34 

78 

28 

12 

Us-airport 

332 

2126 

55 

118 

Email 

1133 

10902 

855 

644 

Highway 

1168 

2481 

31 

798 


5. Conclusion 

Measure the similarity of the node in the complex networks is an interest¬ 
ing topic. In this paper, a new method which is based on the Relative-entropy 
is proposed the measure the similarity of the nodes in the complex networks. 
The nodes with common structure have a high similarity to others. When 
the similarity between those nodes is equal to 1, it means that those two 
nodes have same structure property in the complex networks. The nodes 
which have influential to other or the nodes which are marginal nodes in the 
complex networks have a low similarity to others. The results in this paper 
show that, the proposed methods is useful and reasonable to measure the 
similarity of the node in the complex networks. 
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